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REMARKS 



The Section 112. First Paragraph Rejections 



The Examiner has maintained the rejection of claims 1, 2 and 19 under 35 U.S.C. §112, 

First Paragraph. In particular, the Examiner has pointed out the lack of support for the following 

claim language in claim 1 : 

"likelihood being determined by a single value based upon multiple 
criteria .... Multiple criteria comprising natural language likelihood and 
any combination of character row area coordinates, character type . . . and 
character minimum circumscribing rectangle size, " 
[emphasis added by the Examiner] 

In response to the above rejections, the Applicant has provided the Examiner with the 
following disclosures with respect to FIGURE 4 in the original application at line 16, page 6 
through line 22, page 7. In particular, the most relevant portions of the disclosures are 
underlined for the Examiner's convenience. 

The character recognition unit 124 generates corresponding character 
codes as well as other associated information. Other associated 
information includes the character recognition assurance level, the 
coordinates of a minimal circumscribing rectangle and the size of the 
rectangle . The outputs from the character recognition unit 124 are sent to 
a font determination unit 125, the title evaluation point determination unit 
128, a natural language analysis unit 126 and a recognition result storage 
unit 129. The font determination unit 125 determines a font type and other 
associated information for each character and outputs the font information 
to the title evaluation point determination unit 128. Disclosures on the 
font determination in Japanese Patent Laid Publication Hei 9-319830 are 
hereby incorporated by external referenced. The natural language analysis 
unit 126 compares the recognized characters against a predetermined 
dictionary and determines whether or not the recognized characters match 
or resemble any of the predetermined titles or words in a dictionary. For 
example, the dictionary contains a set of predetermined suffixes which 
indicate a noun form and its corresponding statistical information. The 
natural language analysis unit 126 also outputs the determination 
information to the title evaluation point determination unit 128. A 
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characteristics extraction unit 127 extracts information on certain layouts 
such as underlining, centering and the minimal circumscribing rectangle 
size from the input image and outputs the information to the title 
evaluation point determination unit 128. For example, if the character size 
is beyond 18-point in an A4 image, the minimal circumscribing rectangle 
containing the characters is assigned a high score. Similarly, a high score 
is assigned to a minimal circumscribing rectangle if a number of 
characters or words in the rectangle is less than a predetermined number. 
For example, for the Japanese language, the predetermined number of 
characters may be set to twelve. The above and other predetermined 
numbers are user-definable. 

Based upon the information received from the above 
described units, the title evaluation point determination unit 128 
determines the likelihood of containing a title for each minimal 
circumscribing rectangle and expresses the likelihood in terms of a sum of 
points. For example, a higher number of points is generally given to a 
large size character set since a title is usually larger than text. Similarly, a 
higher number of points is generally given for a character set that is 
underlined or otherwise emphasized. A higher point is assigned to the 
natural language determination information indicating a title row such as 
"title:" or "re:." The points are optionally weighted for each category of 
information. Any combination of categories of the above described 
information is also optionally used to determined[sic] the likelihood. The 
title evaluation point determination unit 128 determines whether the 
largest sum is above a predetermined threshold value to qualify an area 
corresponding to the largest sum of points as a title area. A title 
extraction unit 130 extracts the qualified title while a display unit 131 
displays the extracted title. 

Although the Examiner has acknowledged sufficient support for multiple criteria, 
the Examiner maintained the claim rejections under the section 112, First Paragraph. In 
response to the above argument, the Examiner now explicitly requests support for "how 
these multiple criteria are fit in a single value." 



With respect to the above specific question, the Applicant respectfully requests the 
Examiner to reconsider the following portion of the original disclosures in the application. 
First, " the likelihood" is expressed " in terms of a sum of points" as disclosed at lines 12-13 
on page 7. Secondly, "a predetermined number of points" is added to the likelihood for 
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each of the multiple criteria. With respect to FIGURES 6 through 11, it is disclosed that 
"a predetermined number of points is added to the likelihood." In particular, as disclosed at 
lines at lines 24 through 26 on page 8 with respect to FIGURE 6, "[i]f the assurance level is 
above the predetermined threshold value, a predetermined number of points is added to the 
likelihood for the character row area and a title area selection is determined based upon the 
total number of points in act A304." Similarly, as disclosed at lines at lines 5 through 8 on 
page 9 with respect to FIGURE 7, "[i]f the number of characters is below the 
predetermined threshold value in act A405, a predetermined number of points is added to 
the likelihood for the character row area and a title area selection is determined based upon 
the total number of points in act A406." Also, disclosed at lines at lines 17 through 20 on 
page 9 with respect to FIGURE 8, "[i]f the word satisfies predetermined condition in act 
A504, a predetermined number of points is added to the likelihood for the character row 
area and a title area selection is determined based upon the total number of points in act 
A505 " Then, as disclosed at lines at lines 27 through 29 on page 9 with respect to 
FIGURE 9, "[i]f the font information matches the predetermined font data, a predetermined 
number of points is added to the likelihood for the character row area and a title area 
selection is determined based upon the total number of points in act A603 " Also, as 
disclosed at lines at lines 9 through 13 on page 10 with respect to FIGURE 10, "[i]f the 
ratio is above the predetermined threshold value, a predetermined number of points is 
added to the likelihood for the character row area and a title area selection is determined 
based upon the total number of points in act A705 " Lastly, as disclosed at lines at lines 21 
through 24 on page 10 with respect to FIGURE 11, "[i]f the total character width is 
approximately one half of the minimal circumscribing rectangular width, a predetermined 
number of points is added to the likelihood for the character area and a title area selection is 
determined based upon the total number of points in act A806." 

As quoted above, the original application discloses sufficient information on "how 
these multiple criteria are fit in a single value." Based upon the above disclosures, the 
Applicant believes that one of ordinary skill in the relevant art should be able to practice 
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the current invention. For the above reasons, the Applicant respectfully submits to the 
Examiner that the pending rejections under the section 112, First Paragraph should be 
withdrawn. 

The Section 102(e) Rejections 

In the currently pending Office Action, the Examiner has maintained the same rejection 
basis without any explanation despite Applicant's attempt to distinguish the 102(e) rejections in 
the last response filed on December 8, 2003. For this reason, the following is repeated from the 
last response. 

The Examiner has rejected claims 1 through 12 and 15 through 30 and 33 through 36 as 
allegedly being anticipated by the Katsuyama reference under 35 U.S.C §102(e). The Examiner 
has diligently pointed out a particular portion of the cited reference for almost every claimed 
element. However, it appears that the Examiner has failed to do so for "natural language 
likelihood" as explicitly recited by each of independent claims 1 and 19. Independent claims 1 
and 19 each explicitly recite that "said multiple criteria comprising natural language likelihood 
and any combination of . . . In other words, the natural language likelihood is one of the two 
groups of the criteria for ultimately extracting a title. 

The Katsuyama reference discloses a title extracting apparatus based upon the likelihood 
as measured by points. The points correspond to attributes such as "an underline attribute, a 
frame attribute, and a ruled line attribute of each character string rectangle, the positions of the 
character string rectangles in the document image, and the mutual position relation... " The title 
extracting apparatus thus extracts a character string rectangle with the highest points as a title 
rectangle. As disclosed above, the points are only based upon physical attributes of the 
characters in the text. The Katsuyama reference fails to disclose any attributes that are based 
upon natural language characteristics. Thus, the Applicant respectfully submits to the Examiner 
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that the Katsuyama reference fails to anticipate "natural language" that is explicitly recited in 
independent claims 1 and 19. 

Dependent claims 2 through 12 and 15 through 18, 20 through 30 and 33 through 36 
ultimately depend from independent claim 1 or 19 and incorporate the above noted patentable 
feature of the independent claims. Based upon the above patentable distinction, the Applicant 
respectfully submits to the Examiner that the rejections of claims 1 through 12 and 15 through 30 
and 33 through 36 under 35 U.S.C. §102(e) should be withdrawn. 

The Section 103 Rejections 

The Examiner has rejected claims 13, 14, 31 and 32 under 35 U.S.C. §103 as allegedly 
being obvious over the Katsuyama reference in view of the Chen et al. reference. For the lack 
of the disclosure on the determination of the characters end in a noun form and in a set of 
predetermined suffixes, the Examiner has cited the Chen et al. reference at lines 10 through 55 in 
column 15. Then, the Examiner has concluded that it would have been obvious to combine the 
Chen et al. reference with the Katsuyama reference. 

Dependent claims 13, 14, 31 and 32 ultimately depend from independent claim 1 or 19 
and incorporate the above discussed patentable features of the current invention as explicitly 
recited in independent claims 1 and 19. As discussed above with respect to the section 102 
rejections, the Katsuyama reference generally discloses a title extraction device or software 
program. With respect to FIGURE 40, it is described that the likelihood is calculated in points 
based upon the relative position, the height and the frame/underline information. In detail, the 
points are a sum of plus points and minus points that reflect the size, the shape, the mutual 
position and the position in the document. Among the enumerated criteria, the Katsuyama 
reference discloses or suggests no aspects with respects natural language. 
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The Chen et al. reference discloses a technique for recognizing certain keywords based 
upon optical character recognition. However, the Chen et al. reference is not related to the title 
extraction or the title search in the text. After character strings are recognized, a keyword is 
autocorrelated based upon the vector information of the character string. The Chen et al. further 
discloses a Hidden Markov Model (HMM) for statistically identifying the character string based 
upon a sequence of states with probabilistic transitions between states and features associated 
with each state. Each state is described by a probability distribution of the feature vectors that 
characterize the portion of the character. Although the Chen et al. reference discloses a 
recognition technique for identifying certain keywords, the cited reference fails to disclose, teach 
or suggest the title extraction. 



PATENT 



It appears that the Examiner has simply combined the Chen et al. reference with the 
Katsuyama reference without particularly pointing out any particular reason or motivation that is 
disclosed or suggested by either of the cited references. In this regard, the CAFC requires that 
the reference at least suggests motivation or a reason to combine the reference with another 
reference for the purpose of rejecting a claim under the section 103. For example, In re Rouffet, 
149 F.3d 1350, 1357, 47 USPQ2d 1453, 1457-58 (Fed. Cir. 1998). The section 2143.01 of the 
MPEP also details the criteria for combining references as follows: "Obviousness can only be 
established by combining or modifying the teaching of the prior art to produce the claimed 
invention where there is some teaching, suggestion, or motivation to do so found either explicitly 
or implicitly in the reference themselves or in the knowledge generally available to one of 
ordinary skill in the art." 

Because the use of the natural language processing in combination with other physical 
characteristics of text strings in identifying a title in text is not taught, disclosed or suggested, the 
Applicant respectfully submits to the Examiner that it is not proper to combine the Chen et al. 
reference with the Katsuyama reference for the purpose of rejecting the pending claims. 
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Therefore, based upon the above two reasons, the Applicant also respectfully submits to 
the Examiner that the rejection of claims 13, 14, 31 and 32 under 35 U.S.C. §103 should be 
withdrawn. 

Lastly, the Applicant respectfully submits that because the disclosures of Chen cannot be 
properly combined with those of Katsuyama, the above section 103 rejection basis should not be 
applied against independent claims 1 and 19 in the future Office Action. 

Conclusion 

In view of the above amendments and the foregoing remarks, Applicant respectfully 
submits that all of the pending claims are in condition for allowance and respectfully request a 
favorable Office Action so indicating. 

Respectfully submitted, 
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